Fitting Logistie IRT Modeis: Small Wonder

نویسنده

  • Miguel A. García-Pérez
چکیده

Smte-oi-rbe-art ítem response theo,y (IRT) models use logistie functions exclusivcly as their ítem response functions (IREs). Logistie funcííons mee Ihe reqrtírements thai iheir range is che unit iníerval ami íhat they are monotonically increasing. bur they upase a parameter ipace whose dimensions can only be assigned a meíaphoñcal interpretation in the context of cesting. Applications of 1RT models require obtaining ihe set of values itt Ingístie functíon paranieters thai besí fu an empirícal dato set. l-lowever. success in ohtaininsi such set of values does ncc guarantee thai che constructs they represent actually exisí, for the adequacy of a model is nol sustained hy ihe possihility of estimaring parameters. Thís article illustrates bow meehanical adoplion of off-theshelf logistic lunc[ions as IRFs lbr IRT modeis can result in off-the-shelf parameter estímates and tUs to dato. Thc results of a simulation study are presenced, which show that logistie 11ff modeis can tít a set of dato generated by IREs other chan logiscic funccions just as well as they ¡it iogisric dato. even though the response processes and parameter spaces involved in each case are subsíantially dírferení. An explanation of why logistic funetions wnrk os lhey do is ofreced, che íhcorecical and practícal consequences of Iheir behavior are discuised, and a testable altemative lo logistic IRFs is commented cpon. Key ~vúnIs.goodnass ej/ii, paran cier esíimwíou, ile»> nxsponse theory, legisle: models, Jiniie siate poIynrníur: ,nodels, BILGC

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Beyond Majority Voting: Generating Evaluation Scales using Item Response Theory

We introduce Item Response Theory (IRT) from psychometrics as an alternative to majority voting to create an IRT gold standard (GSIRT ). IRT describes characteristics of individual items in GSIRT their difficulty and discriminating power and is able to account for these characteristics in its estimation of human intelligence or ability for an NLP task. In this paper, we evaluated IRT’s model-fi...

متن کامل

IRT-FIT: SAS® Macros for Fitting Item Response Theory (IRT) Models

Psychometrics has recently seen the development of complex measurement models to better represent test and item data. Item Response Theory (IRT), in particular, comprises a set of non-linear latent variable models that appear to have several conceptual and empirical properties that make them more valuable in practice than classical test theory methods. However, IRT-based models typically requir...

متن کامل

Identifying DIF for Latent Classes with the Dirichlet Process

of the Dissertation Identifying DIF for Latent Classes with the Dirichlet Process by Miles Satori Chen Doctor of Philosophy in Statistics University of California, Los Angeles, 2015 Professor Peter Bentler, Chair In Item Response Theory (IRT), Differential Item Functioning (DIF) occurs when individuals who have the same ability, but belong to different groups, have different probabilities of an...

متن کامل

Building an Evaluation Scale using Item Response Theory

Evaluation of NLP methods requires testing against a previously vetted gold-standard test set and reporting standard metrics (accuracy/precision/recall/F1). The current assumption is that all items in a given test set are equal with regards to difficulty and discriminating power. We propose Item Response Theory (IRT) from psychometrics as an alternative means for gold-standard test-set generati...

متن کامل

Assessing IRT Model-Data Fit for Mixed Format Tests

This study examined various model combinations and calibration procedures for mixed format tests under different item response theory (IRT) models and calibration methods. Using real data sets that consist of both dichotomous and polytomous items, nine possibly applicable IRT model mixtures and two calibration procedures were compared based on traditional and alternative goodnessof-fit statisti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003